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.ABSTRACT ' ^ " . ' . 

This study was designed to explore relationships 
between type and token frequencies and contextual position, effeci^s; 
specifically^' the major question was whether or not vowel cluster 
pronunciation preferences of adult readers were more affected by 
frequency of occurrence than by graphemic environment. Two opposing 
hypotheses were tested regarding four vowel digraph spellings. Six 
synthetic words were constructed for each vowel cluster according to 
contextual and word position constraints. The subjects were 51 

• undergraduates whose task was to read the synthetic words and note 
how they pronounced the underlined vowel cluster. Three models were 
constructed to assess the hypotheses and to predict responses for 
each vowel cluster. The models were a final consonant model, a 
variant type-trOken model, and an .invariant princifjal response mode^ 
Several data analysis techniques were used. The final consTonant model 
was superior to the other two models, but it was found that other 
factors, not yet assessed, were preset in the results. (Author) 
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The mission of the Wisconsin Research and Development Center 
for Cognitive Learning 'is to help learners develop as rapidly 
and effectively as possible their potential as human beings 
and as contributing members of society. The R&D Center is 
striving to fulfill this goal by 

• conducting research to discover more about 
how children learn ' 

• developing improved instructional strategies, 
processes and^materials for school administrators, 
teachers, and children, and 

• offering assistance to educators and citizens 
which will help transfer the outcomes of research 
and development into practice 
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The activities of the Wis&nsin R&D Center are organized 
around one unifying theme. Individually Guided Education. 
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INTRODUCTION " ' 

•t 

\ • 

■ English letter-sound correspondence patterns fall into three dis- 
tinct categories (VenezHy, 1974) . Some are invariant or nearly invciriant 
(b H'' /b/, k A/f and m. /m/) , some are variant but predictable 
(c /s/ before or ^, but c_ A/ otherwise) , and others are 

variant and unpredictable (ch /c/ , /s/, or A/ as in church, chef, 
chorus ; and ea_ /i/, /e/, or /e/ as in team , bread , great ) , Recently 
several studies Iwre excimined * the pronunciation strategies of good^ and 
poor readers for DOth invaricint and variant-predic'table letter-sound 
correspomences (Venezky, ChajSrnan, & Calfee, 1972; Venezky & Johnson, 
1973) . Only dne major study, however, has reported on vciriant- 
unpredictable correspondences (Johnson* 1970) . ' 

Johnson (1970) compared the relaljacrnships of type and token pro- 
nunciation frequencies of real words that contained vowel digraphs' to 
the pronunciations by elementary school children of synthetic words 
that contained the same digraphs. Johnson compiled word type frequencies 
from , a list based on the 20,000 roost common words in the Thomdike . Century 
Senior Dictionary (1941), and he tabulated word token frequencies from 
tHe top 1,000 words in the Brown University cjorpus (Ku^era & Francis, 

1967) .1 : 

Johnson's results showed tha't subjects' responses were mUch more 
closely related to word type proportions than to word token proportions. 
The. most common (principal) pronunciations based on woi'd types appecired 
to be the best predictors ^f vowel cluster pronunciations by children. 
Furthermore, (1) good readers consistently gave moire principal pronuncia- 
tiorrs than poor readers; (2) the percentage , of principal pronunciations 
increased §^om second, to fourth, to sixth grade; and (3) suburban 
children scoreji higher on principal' pjronunciations than urban and rural 
children. - 

Since" previous ^research had indicated that contextual -features may 
influence pronunciation preferences (Calfee, Venezky, & Chapman, 1969) , 
these features were also examined by Johnson (1970) . The pronunciations 
of some vowel clusters, although considered unpredictable, were indeed 
affected by contextual features or by the cluster's position within the 
word. For example, when Xe^ preceded s^, it received the /ai/ pronuncia- 
tion moire frequently than the princ^ipal ^/i/ pronunciation. The reverse 



Word types are distinct words, so that each different word has the 
same weight regardless of its frequency of occurtence in speech or 
print. Word tokens are distinct occurrences of a word, based upon 
total occurrences in printed texts or in speech. 
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was true when ie_ preceded k and when it was in final position. Like- 
wise, oo was more often pronounced /u/ when f6llowed by t, but received * 
the A// pronunciation more frequently when followed by k7 Similarly, 
ow usually received the /au/ pronunciation, but was more likely to be 
pronounced /o/ when in final position. These contextusil-positional 
effj^cts seemed to be highly related to real word occ\irrences . 

The present study was designed to explore further the relation- 
3 between type and token frequencies and contextual-positional 
^s. The basic question being asked was whether the vowel cluster 
^^ation preferences of adult readers are more affected by frequency 
ir^ence features than by graphemic environment. Based upon earlier 
findings, \ the specific environmental feature selected for examination 
was the fc^llowing consonant . ' Two opposing hypotheses for how adult 
readers would pronounce vowel clusters in synthetic English words (i.e., 
in English-like words) were postulated: 

^ H^. The pronunciations will vary according to the number of 

real words that have the same clusters in the same graphemic 
contexts (i.e., the same following consonant "and position). 

H^. The prbiixinciations will vary according to the type or token 
counts of real words with the same clusters, regardless of 
graphemic context. 

To test these hypotheses, four vowqI digraph spellings were selected 
on the basis of theiir variant, but, seemingly patterned, pronunciations 
in a corpus of ^high frequency words. The digraph spellings selected 
were oo, ou, ea) and ow? their pronunciation^ in common monosyllabic 
and disyllabic words are summarized as follows.^ 



oo is usually /u/ before n, m, or 1 ( moon , broom, fool ) and 
i Ar/ before k or d ( book , good ) , but about equally /u/ 
and A>/ before t (boot, foot). 



ou is usually /au/ before n, t, and d ( ground ,- trout , loud) 
and /a/ before £, b, or ch (couple, trouble, touch) . ^ 

is usually /i/ before k or t (beak, hgat) and /e/ before 
th or 1^ ( heather , wealth) , but about equally /i/ and /e/ 
before d or n ( plead , dead, mean) . 



ea 



ow 



is usually /o/ in- final position ( grow , slow, snow) and 
. /au/ before d ( chowder , crowd) , but about equally /o/ and 
/au/ before n ( grown , crown ) . 

Table 1 presents the principal pronunciations of each of the four 
vowel clusters by word type frequency. and by word token frequency 
This table shows that for two of the selected vowel^clusters, ou and 
ea, the principal pronunciations for word types and word tokenF^are 

2~ ' X 

^This. summary is based upon unpublished data on the letter-sound ^ 
correspondences of |the 20,000 most common words in English. 

■ 0 / 



TABLE 1 



WORD TYPE AND WORD TOKEN PRINCIPAL PRONUNCIATIONS 
AND PERCENTAGES FOR THE VOWEL CLUST^jftS-^ . 
00, OU, EA, AND OW* 



i/pal 



Word Types 



Word Tokens 



Principal 





Pronx^nci^tion 


Percentage 


Pr^onunciation , 


Percentage 


GO 


/u/ 

« 


62.2 


Xr/ 


50.0 


ou 


' /au/ 


50.1 


/au/ 


i6.4 


ea 


/i/ 


53. l"^ 


/i/ 


/ 51. A 


OW . 


/o/ 


51.2 

— ' '■ 


/au/ * 


51.4 


*Data 


from Johnson' (1970) . 




t 






the same, while for 
tions for word types 


the other two, oo 
and word tokens 


and OW, the principal pronuncia- 
are different. 
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METHOD • • 

STIMULI V 

For each of the four selected vowel clusters six synthetic words 
were constructed according to the contextual and word position con- 
straints discussed previously. Each, stimulus was then matched with 
four real word alternatives; one alternative contained the principal 
pronunciation of the vowel digraph in the synthetic word, and the other 
th^ee contained frequently pccuTring pronunciations of this digraph. 
The 24 test items were then randomized and printj.ed.on an 8-1/2 x ll 
sheet of paper (see the. Appendix) • 

SUBJECTS ^ " " ' ' 

The 51 subjects were undergraduates at the University of Wisconsin. 
Twenty-seven were enrolled in a beginning course in computer program- 
ming and 24 were enrolled in a beginning course- in elementary education 

PROCEDURE 

Subjects were told to read a synthetic word to themselves and to 
note how they pronounced the underlined vowel digraph. They then were 
to circle the real word in the same ^ow that contained the same vowel 
sound. The test was group administered (separately to the two groups), 
(emd subjects were encouraged to work as quickly as th^y could. 
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Ill • . 

RESULTS . ■' * 

, " ; . • 

A sunnnary of the responses of' each group to each digraph, summed 
across test items for each digraph, is shown in Table 2. Separate 
chi-square analyses of the digraph responses comparing the tot^l num- 
ber of principal r.esponses to nonprincipal respxDnses by group showed 
a significant group difference for oo^ (x^ = 0.68, df_ = 1, £ < .01) 
but no significant group differences for the other three digraphs. 
Consequently, the two groups were combined for subsequent analyses. 

The respor>ses to each test item^ suinmed across the subject groups', 
are shown in Tabl^ 3. For all six ea test items, a single pronuncia- 
tion t/i/) dominated; there was only a slight deviation in' the responses 
to theat.^ The response /u/' dominated for all oo items except yook , 
which received twice as many A// responses as /u/ responses. ( Flood , 
although assigned the dominant pronunciation by 64 percent of the sub- 
jects, received a .relatively high number of AV responses — 10 of 51.) 

Both ou and ow, although assigned dominant pronunciations (/au/ 
for both) in 69 percent and 67 percent of the cases, respectively, 
showed 'more variation in response patterns thcin did the other two 
digraphs. For ow, an /o/ pro^iunciation was selected by 80 percent of 
the subjects for smow , by 37 percent for drow ^ and by 21 percent^ for 
towd . In all, the /o/ pronunciation accounted for cibout 29 percerpt 
of the responses to ow words. For ou, an /u/ pronunciation was selected 
by 53 percent of the subjects for thoup , by 27 percent for loun , and 
by 16 percent for soud . In all, .^e /u/ pronunciation accoiJntpd for 
about 20 percent of the response's to ou test item.s. 

. For assessing the various hypotheses stated earliier, thyree models 
were constructed and then used to predict the nun±)er of principal 
responses to be expected for eagh vowel digraphs The models tested 
were : 

1. Final consonant model . For each test item, all of the mono- 
syllabic words in the Thorndike word list <see Johnson, 1970) 
» that had the same vowel and consonanfending as th$ test item 

(or vowel plus juncture in the case of smow and drow ) were 
listed by vowel pronunciation. The percentage of these 
* words that contained principal pronunciation for the test 

word digraph was then used to predict the number of/principal 
responses that would occur for the test item. / 

U • 
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TABLE 2 ■ ? 

- . 

PAGES OF RESPCWSES BY SUBJECT GROUP 







AND 


DIGRAPH 

> 












ea - 










*; 6 

\ 




/V 




/a/ 


/e/ 


Group 1 


(Edu.) 


86.8 


11.8 


a, 


r7 


0.7 


Group 2 


(C.S.) 


93*. 2 


5.6 


0 


.0 


* 1.2 


:-y ' v Total 




90 . 2 


O.J 


0 


.3 


1.0 




• • 


oo 














/u/ 


/V/ 


/a/ 


/o/ 


. Group 1 


(Edu. ) 


66.7 


' 16.7 


6 


.3 ■ 


10.4 


V Group 2 


(C.S.) 


82.7 


11.1 


2. 


5 


. 3.7 


Total 




75.2 




4. 


3 


6.9 






ou 
















/a/ 


/V/ 


• /A/ 


, Group 1 


(Edu..) 


74.3 


■ ■ 14.6 


5. 


6 


5.6 


^ Group 2 


(C.S.) 


64.6 


24.2 


6. 


2 


5.0 


Total 




69.2 


19.7 


5. 


9 


5.3 






ow 














/au/ 


/o/ 


/a/ 


/a/ 


. Group 1 


(Edu.) 


7oa 


25.7 


2. 


1 


2.1 


Giroup-2 


(C.^.) 


63.6 


31;5 


3. 


1 


1.9 


* ' Total 

r 




66.7 


28:8 


2. 


6 


2.0 



2. 



Variant t ype-token model .. The higher of the type and token 
percentages for the principal response for each digraph was 
used to predict the number of responses to all stimuli within 
a digraph group. ^Token percentages were based upon the top 
1,000 words in Kucera and Francis (1967), while type per- 
centages were based upon the Thorhdike list mentioned 
previously. . 



3. 



Invarian t principal response model . This model assumes that 
all responses to a digraph will- be the principal response for 
that digraph. 



The. predictions made by earh model for feach stimulus item are 
reported in Table 4^ which also shows the actual responses made by 
the combined subject populations. (Since no English words end, in 
-oub, the test item doub was eliminated from the comparison of models, 
leaving 23^ stimuli in four digraph groups.) 
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TABLE 3 



TOTAJi RESPONSES TO EACH TEST ITEM 
- BY VOWEL- PRONUNCIATION 
(N = 51) 



ERIC 



ea 




/w 


/a / 

/a/ 


/e/ 


yeath 


A C 

45 


A 






brean 


* 4o 


J 




u 


theat 


*38 


13 


0 


0 


pleal 


48 


1 


L 


1 


glead 


48 


3 


0 


0 


feak 


49 


2 


0 


0 


oo " 


/u/ 


/V/ 


/o/ 


/A/ 


foon 
nool 


50 
• 46 


%0 . 
2 ■ 


0 
3 


1 

0 


tooin 


46 


2 


2 


1 


- yook 


16 

♦ 


• 32 


3 


0 


plood 


32 


3 


6 


10 


doot 


40, 


_ . , 3 


7 


1 

\ 


ow 


/au/ 


-/o/ 


/a/ 




smow 


9 . 


41 


1 


U 


bown 


44 


7 


0 


0 


drow • 


32 19 • 


0 


0 


trowd 


41 




1 


3 


towd 


36 


'll 


4 


0 


pown 


42 


4 


2 


3 


ou 


/au/ 


/u/ 


/A/ 


/v/ 


doub* 


33 


4 


8 


5 


front 


' 49 


1 

s 


0 


1 


thoup 


' 19 


27 


1 


4 


loun 


33 


14 


3 


1 


rouch 


41 


6 


0 


4 


soud 


36 


8 


4 


3 


*One subject did not 


respond to 


this' 4-tem. 
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TABLE 4 

THE NUMBER OF PREDICTED AND ACTUAL PRINCIPAL RESPONSES 

TO ^CH TEST ITEM 



Word 


Model 1 


Model 2 ^ 


Model 3 


Actual 




> 

w J. 


TO •< 


51 


50 


i ^ 




Jz 


51 


46 


stoom 


SI 


9 


bl 


46- 




c 






16 






JZ 


51 


32 


\Ji\J\J u 




JZ 


51 


40 




Z J. 


zo ^ 


51 


' 9 


bown 




9A 
zo 


bl 


44 




^ 91 
Z J. 


zo 


51 , 


32 




0 J. 


Oil 

zo 


51 


-41 






ZO. 


51 


36 






26 


51 


42 




^ D J. 


Oil 

zo 


51 


49 




. U 


26 


51 


19 


loun 


51 


26 


51 


33 


rouch 


38 




51 


41 


soud 


51 


26 


51 


36 


yeath 


4X 


' 29 


51 


45 


brean 


51 ' 


29 ' 


51 


48 


the at 


43 


X 29 


51 


38 


pleal 


51 


29 


^ 


48 


glead 


18 


29 


51 


48 


feak 


44 


29 


51 


49 



IS 
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To compare the three mode-Is, several different measures were 
used. The following were used in the analyses. 



xik the response of the ith student to the kth word. 

(xik is 1 if the principal response is given, and 
zero otherwise.) 

-S^km the predicted number of .principal responses to the 
kth word under the mth hypothesis. 
\ 

yk the actual number of principal responses to the kth 

word (y, = E . X. ' ) . 
k 1 ik 

The measures used to compare the three models were as follows. 



Sum of absolute values of differences, based on group 
scores. This required the computation of 



A = I 



m 



km 



for each of the three models and was probkbly the 
weakest tes't that was run. The resulting values 
were = .218, A2 = 314, and A3 = 287. According 
to'thes^ results, model 1 (final consonant) had the 
least amount of error, followed by model 3 (invariant 
principal) and then by model 2 (variant type-token) . 

Sum of squares of absolute diffferences, based on 
group scores. This was similar to analysis 1, but 
gave higher weight to the more deviant resul'ts. 
For 



Sm = Z. 



the resulting values were S-j^ = 3338, S2 = 4982, and 
S3 = 6283. Model 1 thus remained in the same posi- 
tion as in analysis 1, but models 2 and 3 changed 
places. 



3. Analysis of variance--sum of differences. This re- 
quired the computation of 



Imi k 

where W 



W 



ikm 



ikm 



= X., - y^/51. 
ik km 



2]jni was a measure of the error made under. the mth hypothesis 
for the ith student. Then, an analysis of variance was performed to 
test the hypothesis that U^^ = U^^ " ^13' ^^^^^ = E Z^^. 
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This comparison, using repeated measures, showed a significant 
difference among the means JfF ( 2/49) = 128.6508, £< .0001]. Pair- 
wise contrasts showed that model 3 was superior to model 1 and that 
model 1 was superior to model 2. , 



4\ 



Analysis of variance — sum of faquares-t This was 
analogous to analysis 3, but (was bas^*^ upon 



'I 



2mi 



4 



ikm" 



Once again, a significant difference between the ^ * 
means was foiand [F(2/49) =^15.4 £ < .001].' At the 
.05 level, modeU 1 was significantly better than 
model 2 and model 3, while model 2 was better, but 
not significantly better, than model 3. 

Several other analysis techniques were attempted. Two of these 
analyses were similar to analyses 3 and 4, but included the random 
assignment of the subjects into 17' groups of three siobjecits each. 
These latter two analyses both showed model 1 to be superior but were 
not consistent in the ordering of models 2 and 3. 
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[>ISCU3SI0N 



That the responses to certain words such as yook , thoup , and smc 
deviated from the principal responses in: a manner 'that tended to be 
predictable on the basis of following consonant indicates that^the 
simple type/token ntodels in which all test items' in a digraph cliass 
are assigned the same expected frequency for the principal response 
afe inadequate. Following consonant had a definite influence over 
the responses that the siabjects gave, even though it is evident from 
this stvidy that final consor^ant is not th^ oi;ily factor that .influences 
pronunciation. The fiiial consonant model was superior in both compari- 
sons based on deviations of- group scores from predicted scores (analyse 
1 and 2) and in the analysis of variance based on the square of the 
individual variations from the predicted scores (analysis* 4) . Only 
in the analysis of variance based on absolute values of individual 
deviations (analysis 3) was the final consonant model not superior 
to the other two. The change in the position of the final consonant 
model frqm analysis 3 to analysis 4, however, implies that the final 
consonant model has less variance than the other two models. 

We conclude from these cuialyses that the final consonant model 
is indeed superior to the other two models, but that other factors 
' which have yet to be assessed also are present in the results. A 
inodel th^t might provide a higher degree of predictability than the 
nlodels used here is a final consonant model based on token coxints 
rather than on^ type counts. This \model would be especially effective 
if the final consonant influence derives from ^alogy with a few high 
frequency words rather than from a generalization ba^d on all real 
words that contaip a particular spelling. At the same\time, the 
influence of initial consonant* cannot be rejected, especially in 
light of the different response patterns that smow and drow elicited. 
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1. 


yeath 


2. 


smbw 


3. 


bown 




foon 


V 

> 


doub 


6. 


brean 


7. 


^ theat 


8. 


frout 


9. 


drow 


10. 


trowd 


11. 


thoup 


12. 


loun 


13. 


pleal 


14. 


tbwd 


15. 


nool 


16. 


stoom 


17. 


rouch 


18. 


yook ' 


19^ 




20. 


glead 


21. 


plood 


22. 


feak 


23. 


doot 


24. 


soud 




\ 



y 



meet 


.bed 


ton 




about 


boat 


ton 


mop 


ton 


about 


boat 


mop 


put 


blue' 


tOJl 1 


7 boat 


put 


ton 


' cow- 


blue 


pla^ 


bed 


meet • 


ton 


bed 


ton 


J 
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